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ABSTRACT 



A brief resume of Che cole of linear utility functions in Came 

Theory is given. The point is made that, In practical applications, a 
knowledge of these functions is usually not available, and hence much 
of the rationale of the game theoretic approach to competitive problems 
Is lost. The random character of the "real" payoff of a matrix game is 
then discussed, and the probability distribution function of the payoff 
is derived. The dependence of this distribution function upon the mixed 
strategies of the players is shown. Criteria are developed to provide 
definition of "optimal mixed strategy" in terms of the effect on the 
distribution function. The mathematical formulation of the solution is 
given for each criterion discussed. 

The reader will require knowledge of the elements of Probability 
Theory, Game Theory, Utility Theory, and Linear Programming, 
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Introduction, 



The purpose of this paper is to investigate a difficulty that inevit- 
ably exists whenever an application of the mathematical Theory of Games 
is made in a non- idealized situation. We shall restrict our attention to 
the class of two-person matrix games, although the basic question arises 
in connection with n-person and continuous games as well. Briefly stated, 
the problem is as follows: 

In the logic of the Theory of Games, a vital part is played by the 
so-called "linear" utility function, a real-valued function defined over 
the space of outcomes associated with a game. In actual practice, it is 
very doubtful that this function will be available to, let us say, a mil- 
itary operations analyst who wishes to formulate a practical problem as 
a game. On the other hand, when the situation involves a conflict of in- 
terest between intelligent antagonists, with an element of chance affect- 
ing the outcome of any course of action, . t is difficult to find a concept 
of "rational procedure" or "optimal decision" which does not involve the 
game theoretic approach. Therefore the question becomes: in the absence 

of an Important part of its machinery, what adaptations of Game Theory 
can provide us with reasonable solutions to problems of practical impor- 
tance? 

In standard texts on the Theory of Games, it is usual to present a 
discussion of utility theory at an early stage. The axioms leading to 
construction of the linear utility function (hereafter referred to as 
the l.u.f.) are developed, and once this has been done, it is customary 
to assume that the appropriate l.u.f. 's are known, and proceed with the 
mathematical development of the theory. 
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We wish to change this procedure slightly, in order to do two things: 

(1) Clarify the theoretical role of the i.u.f. in the logic of Game 
Theory. 

(2) Emphasize the vital importance to the military operations analyst 
of a knowledge of appropriate l.u.f, 's, when formulating "real" problems 
in game- theoretic terms. 

2. Typical formulation of a competitive situation as a matrix game. 
Distribution of the payoff. 

Suppose that a situation involving conflict of interest between two 
parties ("player I" and "player II”) has been analysed so as to yield the 
following quantities: 

(1) The space of all pure strategies available to player I. 

(2) The space of all pure strategies available to player II. 

(3) A matrix, the elements of which describe what happens when play- 
er I follows his i*k pure strategy, and player II follows his pure 
strategy (i= l,2,...m; j= l,2,...n). 

We shall assume that the outcome, or payoff to player I, resulting 
from the choice (i,j) of pure strategies, can be measured in terms of a 
single "real commodify", e.g., money, or casualties, etc. 1 

The first important point we make here is that nothing in Game 
Theory requires that a fixed real-commodity payoff to player I be assoc- 
iated with the pair (i,j). Rather, the requirement is this: the payoff 

) 

to player I resulting from the choice (i,j) is a random variable Xjj, 
the distribution function of which is known to both players. Denote this 
distribution function by F^j(x) = P[Xij£x] . This leads us to the 

1. This restriction simplifies the notation without affecting the logic 
of the argument. 
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following Definition: F(x) = ( fFij(x) is a matrix of known distrib- 



ution functions. This matrix describes in probabilistic terms the pay- 
off to player 1 associated with each possible choice (i,J) of pure strat- 
egies by the players. 

Now assume that the players wish to employ mixed strategies. 

Q' = (q^.q2* * * 4< W is a strategy of player I. 

P' 35 (Pj»P2» • « *P n ) is & strategy of player II. 

When the mixed strategies Q and P are used, we can determine the 
probability distribution function of the payoff to player I as follows: 

Define Y to denote the real coranodity payoff to player I when the 
mixed strategies Q and P are used in conjunction with the matrix F(x). 
Then 



Looking at equations 1 and 2 , we can see that the distribution of 
the "real" payoff to player I involves two types of parameters: (1) those 
associated with the distributions Fjj(x), which are assumed given and un- 
changeable, and (2) the elements q^ pj of the matrices Q and P. These 
are under the control of the players. 

We can now formulate the objective of player I in choosing his mixed 
strategy Q. Player I wishes to choose Q so a® to optimize, in some sense, 
the form of the distribution function G(y). This interpretation of the 
role of a mixed strategy i® a central feature of this paper. 




(eq. 1) 



This can be written in matrix notation as: 



G(y) - Q'F(y)P . 



(eq. 2) 
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At this point the theory of the f entire the picture. In the 
next section we see how the l.u.f. provides a criterion whereby each player 
may cake a preferential ordering of his possible mixed strategies, and 
select an optimal one. 

3. The theoretical and practical roles of the l.u.f. 

Appendix A presents for quick reference the axiomatic development of 

the l.u.f., largely following the excellent treatment In Luce and Raiffa 

f2] . Appendix B gives a very slight extension of the Idea, primarily so 

that we can talk about a continuous l.u.f. u(x) defined over an "infinitely 

2 

divisible" real commodity . 

In this section, we do no more than write down some conclusions and 
interpretations concerning the l.u.f. Appendices A and B are, hopefully, 
arranged so that the readier who wishes to check the reasoning behind the 
statements made here may do so easily. 

The pertinent conclusions concerning the l.u.f. are the following: 

(1) The axiomatic design of a player's l.u.f. is such that it calls 
on information not available in the matrices F(x), Q, and P, in order to 
measure the true value that a real commodity has for the player, vis-a-vis 
the gambles involved in a risky situation (i.e., the game). 

(2) Suppose a player enters a gamble of the following form: he will 
win (or lose) a random amount X of some commodity, where X has the known 
distribution function H(x). This i® called a "simple" gamble. If the 
player has a l.u.f. u(x) defined over the commodity, he can compute the 
true value v which the simple gamble has for him by forming the 

2. The notation tends to be simpler when discussing this case. 
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expectation 



v =* Elu(X)] - u(x)dH(x) 



(3) Suppose the player is involved in a "compound" gamble; that is, 
with probability he enters a simple gamble H^Cx), k - l,2,....n. 

The true value v to the player of this compound gamble can be computed in 



(4) Even though the operation involved in (2) and (3) above is that 
of taking an expected value of u(X), the interpretation of v as a "true 
value" does not depend upon there being many repetitions of the gamble. 

The l.u.f. is so designed that it measures the value of a one-time gamble. 

(5) Once a player has determined what his l.u.f. u(x) is for a cer- 
tain commodity x, he can use u(x) to compare any two gambles involving x. 
If H}(x) and {^(x) are the distribution functions associated with any two 
such gambles, simple or compound, the player computes 



The gamble Involving H^(x) is preferred to the gamble involving ^(x) if 
and only if vj > v 2 . By using u(x) in this way, the player obtains a com- 
plete and transitive ordering of his preference for ail possible gambles 
over the commodity x. 

These are the five conclusions which are of most interest to us. 
Next, we make some interpretations, based on these conclusions. 

1. This property of u(x) contributes the unfortunate word "linear" to 
the name of the l.u.f. 



terms of his l.u.f. as* 
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(1) When the players In a two-person matrix game moke a choice of 
pure strategies, a "simple" gamble results. From player I's point of view, 
this gamble involves the random variable , with distribution function 
Fij (x). 

(2) When the players employ mixed strategies Q and P, player I faces 
a "compound" gamble involving the random variable Y with distribution 
function G(y), as given by equation 1. 

(3) Recall our earlier observation that by his choice of a mixed 
strategy Q, player I exerts himself to determine a form of G(y) which is 

in some sense optimal for him. We now interpret the theoretical role of the 
l.u.f. in the logic of Game Theory to be: the l.u.f. is a criterion- 

providing device that enables a player to compute which form of G(y), and 
hence which mixed strategy, is best for him. 

(A) Suppose that a matrix game involves a real commodity x for which 
the players know their l.u.f. 's to be u^(x) and U2(x) respectively. Suppose 
further that ^(x) - -Uj(x) . This is of course the so-called "zero-sum" 
assumption. If the players use mixed strategies Q and P, we write the 
distribution of the real payoff Y as G(y) «* G(y;Q, P) to emphasize the depen- 
dence of this function upon the players' choices. It now follows from the 
discussion under conclusion (5) above, that for extreme conservatism the 
optimal choices “ (q®, . . . .q^) and P^ - (Pl,....Pn> oF nixed strategies 
are defined by the equation 



u(y)dG(y;9 0 ,P o ) 



max min \ u(y)dG(y;Q,P) 

Q P <«> 



min max ' u(y)dG(y;Q,P) 
P Q 



(eq.3) 



Written in this form, the definition of optimal Q , P looks unfamiliar 

o o 
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and rather formidable. However, referring to equation 1, we can re-write 
equation 3 as: 



i.j 



o o 

«iPj 




max min 2Zq 4 P 4 f u(y)dF, (y) 
Q P i.j 3 L J 



(eq. 4) 

The "linear" property of u(x) discussed under conclusion (3) above makes 
it proper for us to define 



/°° 

a ij 5 \ u <y> dF ij(y)» 

-ao 

and to re-write equation 4 as 




max min 

Q P 



r 

i.j 



<UP 



j*ij* 



(eq. 5) 



By introducing the utility matrix A = j| ^ a ij)|j > w can reduce equation 5 
to the familiar form: 



Q*AP = max min Q'AP = min max Q'AP . (eq. 6) 

^ ° Q P P Q 

Thus we have seen, step by step, how knowledge of a player's l.u.f. 
for a commodity x lies at the center of the famous minimax formulation of 
the zero-sum game solution, as given in equation 6. The essential obser- 
vation to make is: if the l.u.f. 's are not known, the rationale leading 

to equation 6 as a definition of optimality breaks down, for there can be 
no guarantee that the players are maximizing (minimizing) the "true value" 
of the compound gamble represented by the game. In particular, it is not 
in general correct to replace a^ by the quantity m^j = ) xdFjj(x), and 
to maximize (minimize) the expected real value of the game. To do so 
ignores the entire scheme for evaluating gambles that is provided by the 
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construction of the l.u.f* 



This completes our review of the role of the l.u.f. and it® practical 
importance to the operations analyst who wishes to use the "customary" 
approach to the solution of a matrix game, as represented in the zero-sum 
case by equation 6. 

In the next section, we discuss procedures for handling a competitive 
situation when a knowledge of l.u.f. 's is not available. It seems approp- 
riate, however, to first answer the question, "Why not devote the effort 
instead to the problem of determining the l.u.f. experimentally, rather than 
adopting the pessimistic attitude that appropriate l.u.f. 's are not likely 
to be known?" 

The answer, of course, is that a great deal of work has been done in 
an effort to measure representative l.u.f.’s, and the results have been 
largely discouraging, not to say unbelievable. See, for instance, [3] 
and [4], Since experiments performed under controlled and simplified 
laboratory conditions have failed to yield satisfactory results, it does 
not seem reasonable to assume that the military operations researcher, 
working with far more complex problems of value, will be able to isolate 
functions which truly describe how responsible military commanders evaluate 
difficult situations in terms of "linear utilities". As a practical matter, 
it seems much more sensible to look for procedures which make some sense 
in the absence of knowledge about linear utilities. 

One more point should be made before we go on. If we are going to 
abandon all knowledge of utility functions and attempt to "solve" compet- 
itive situations (games) using only the distribution function G(y) as a 
point of departure, we oust clearly understand what we can and cannot hope 
to accomplish. 
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Consider the situation in which a matrix game has been formulated, and 
a knowledge of appropriate l.u.f.'s assumed. In this case, if the payoff 
is zero-sum, the assertion of Game Theory is that there is a logical and 
highly plausible definition of "optimum solution" available, and a scheme 
for finding it*. In other words, we are guaranteed the optimum result. 

Contrast this with the situation that exists when no knowledge of 
the l.u.f. is assumed. This lack of knowledge means that we do not know how 
to value the commodity for which we are gambling, vis-a-vis the risks in- 
volved. It is clearly unrealistic to hope that, in this condition of ignor- 
ance, we shall be able to characterize any decision we make as the optimal 
one. We can hope to formulate criteria which may be "plausible" and 
"reasonable" when applied to specific situations, and we can hope that these 
criteria, through their mathematical formulation, will lead to decisions 
which are "approximately optimal". This is of course a very loose state- 
ment, since we do not have, and will not formulate, a means of measuring 
"distances" between a theoretical optimum and any other point in the space 
of possible solutions. We are simply proceeding in accordance with the be- 
lief that it is better to have some well-understood criterion for the good- 
ness of a decision, and some systematic scheme for computing what that 
decision should be, keeping in mind all the potential shortcomings, than 
to have none at all and simply proceed hit-or-miss. 

4. Criteria for the solution of matrix games in the absence of linear 
utility functions . 

Consider first the case of a game that is going to be played one time 
only. For this case, we shall formulate solutions that are approximately 

1. We have not discussed such schemes so far, but the "simplex method" 
may be regarded as a general algorithm for solving this class of gamis. 
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optimal under the following criteria: 



Criterion A: That mixed strategy is best for player I which 

maximizes the probability that the real -commodity payoff will 
lie in any one of a finite number of given, disjoint inter- 
vals, assuming that player II acts so as to minimize this 
probability. 

Criterion B: That mixed strategy is best for player I which 

i) maximizes the probability that the real- commodity 
payoff will exceed a given amount b, subject to the condition 
that 

ii) the probability that the payoff is less than a given 
amount c, shall be less than a given figure a, no matter what 
strategy player II uses. Assume that player II wishes to min- 
imize the probability that player I receives more than b. 

These are certainly not the only criteria one could consider. How- 
ever, they have features which are of practical value. Three obvious 
special cases of Criterion A call for 

(1) maximizing the probability that the real payoff exceeds a given 
value (perhaps some "critical" value), 

(2) minimizing the same probability, 

(3) maximizing the probability that the payoff lies in some specified 
interval. 

Certainly one can think of practical situations in which one of these 
might be a very reasonable course of action to follow. What is more impor- 
tant, it is not hard to imagine that a military commander might have just 
enough information to decide on one or two "critical" values, when a utility 
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measure over all possible outcomes would be impossible to fled,, As an exam- 
ple, the commander who decides, "I oust at all costs destroy at least 80 % 
of the enemy fleet in this engagement", has adopted special case (1) above 
as his criterion for an optimal course of action. A more homely example 

9 

would be an expert poker player who decides, "1 want to win at least twenty 
dollars in this game, but I’d better not win more than fifty, or these 
people might not let me fleece them in the future. " The poker expert is 
looking toward special case (3). 

Obviously, Criterion A is worth considering. The general formulation 
is no more difficult than the special cases, as will be seen^. 

It is a little harder to read sense into Criterion B at first glance, 
but an example will make it clear that it too has a plausible, coiooon>sense 
basis. Suppose a businessman has the following thoughts: ”1 would like 

very much to make a profit of $100,000 or more this year. ($100,000 ~ 
given amount b). On the other hand, I'll have to take some risks to do that. 
I have some debts outstanding, my son enters Harvard this September, and I 
want to vacation in Hawaii this year. If I show a profit of less than 
$10,000, things are going to be tough. ($10,000 » given amount c). Well, 
I'll try for the big profit, but only if I can work it so that my chances 
of making less than $10,000 are small; say about 10%," (10% = given figure 

a). 

One can easily construct examples with a military flavor, involving 
the value of an objective, and the risks and costs associated with achieving 
it. As with criterion A, the point is this: It is far more reasonable to 

expect our businessman in the example to somehow get hold of the figures 
$100,000, $10,000, and 10%, than it is to expect him to construct his 

1. It is also convenient to solve this problem for any finite number of 
intervals, since solutions for all the more practical special cases follow 
immediately. 
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complete l.u.f. for money. As for asking the Joint Chiefs of Staff what 
their combined l.u. f. for casualties in Southeast Asia might be —one 
shudders at the thought. 

In connection with both Criteria, we have made the assumption, essent- 
ially, that player II’ s aims will be directly opposed to those of player I. 
Obviously, in many real situations this will not be true. When we deal 
with this difficulty, we get into the whole problem of non-zero-sum games, 
definitions of stable solutions, etc. 

Now, in the absence of l.u.f. *s, or any measure of "true value" assoc- 
iated with real outcomes of a game, we lack the means for deciding whether 
or not a game is zero-sum. Hence, we should really consider the difficult 
non- zero- sum case. To get around this situation, we adopt the following 
point of view, which should be regarded as part of Criteria A and S, and 
therefore as an "input" to any solution stemming from them: 

The most conservative policy that player I can follow is to assume 
that player II is an "implacable foe". In practical situations where this 
is largely true, all will be well, and incorporation of this assumption 
will be largely justified. When this assumption is not true, player I will 
get at least slightly better results than if player II were "implacably 
opposed", but will not be able to take maximum advantage of possible 
"consainity of interest" arrangements, etc. In this latter case, the pro- 
cedures which we develop here probably will not be very good, and player I 
should exercise his ingenuity to analyze the special features of the part- 
icular situation, in order to find a better solution. In other words, a 
practical application of these ideas requires a realistic look at the con- 
text in which they are to be applied, and no good solution can be expected 
from their use in inappropriate circumstances. 
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5. Macheiaatical foma Nation of the solution under criterion A, 



We are still dealing with the case where the real payoff is one- 

dimensional, that is, expressible in terms of a single real commodity x. 

Following the previous notation, let Q and P be mixed strategies of 

players I and II respectively, F(x) the matrix defined on page 3, and Y 

the real payoff to player I resulting from choices of Q and P. Denote 

the disjoint intervals of Criterion A by 1^ » [a^,bj] » i 2 “ »••*» 

I “ [a ,b 1 . Then 
n n n 

n 

P [ YeljL or Y<£I 2 or... Y£I J - 2Z \ dG (y) 

n k**l a^ 



-S- A 



- -> hi 

k “' a k 



(y) 



by equation 1. 



■ t E, „ ( k ti H v« [p lt <v ’ "i.M 

k=l i,jVj dF iJ (y) = k=l i,J J ij 1J 

®k 

Using matrix notation (see equation 2), re-write this as 



Q’ |><b ) - F^)] P + Q* [F(b 2 ) - F(a 2 )] P +.....+ Q' 0<M - P(a n )] P 
= Q' [F(bj) + ... + F(b^) - F(a^) -,.,-F(a n )]p . 

In the last expression, denote the matrix F(bj) +. .- FC^)] by H. This 
is a matrix of real numbers, completely determined by F(x) and the intervals 
1^, which are given quantities. Therefore, Q’HP is an ordinary bilinear 
form, and under our assumption about the behaviour of player II, conservative 
optimal mixed strategies and P Q are defined by 

<£HP 0 = max min Q'HP . (eq. 7) 
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The nice feature of this result is of course that all the machinery 
for finding Q q and P q that one uses in connection with zero-sum gomes, is 
appropriately used here* The difference lies in the interpretation of H 
as a matrix whose elements come from probability calculations, rather than 
a utility function. 

In the special case of maximizing P [Y > c] where c is some "critical" 
value, the result is 

oo ( oo 

P[Y>c] = \ dG(y) = 2_ . d Fii(y) = ZL ^i^l L 1 " F i«< c )] * 

'c i,j J c J i,J J 1J 

To put this in the more convenient matrix notation, define 1 to 

rm 

be a m-by-n matrix each element of which is unity, and write 
p[y>c] = Q' [imn - F(c)] P . 

Optimal strategies and P^ are defined by 

% [l„- F(c)3 p c = »ax min Q 1 [l m - F<c)] P (eq. 8) 

Q P 

An interesting sub-special case develops when the probability dis- 
tributions F^Cx) of the matrix F(x) are degenerate, that is: 

F t j (x) = 0 for x - hi j 
= 1 for x > hi j 

This corresponds to the case where the real-commodity payoff associated 
with the choice (i,j) of pure strategies is a fixed amount h^j, rather 
than random. In this case the matrix Cl - F(c)] of equation 8 is a 
matrix in which each element is either zero or one. Choice of a simplified 
criterion certainly leads in this case to a simplified problem to be solved 
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Mathematical formulation of the solution under Criterion B, 



The procedure for finding an optimal strategy under Criterion B is 
somewhat more complicated than it is under Criterion A. We use a linear 
programming approach [5j , [6] . The problem turns out to be of the 

"maximization" variety, which entails 

(1) maximizing a linear form w i t i * n non-negative variables t^, 
subject to 

(2) a set of linear constraints (inequalities) on the variables t^. 

Once such a problem has been correctly formulated, the set* of val- 
ues of the variables t^ that maximizes the linear form may be found 
(provided a set satisfying the constraints exists) by the "Simplex 
Method" [ 5 ] , l 6j , which may be regarded as a general algorithm for solv- 
ing linear programming problems. In any particular case, the amount of 
computation involved may be enormous, so that a high-speed cosqmter may 
be needed to get actual numerical results. Conceptually however, the 
solution is always computable when it exists. 

In Appendix 5 of their book "Games and Decisions", Luce and Ralffa 
give an excellent formulation of the ordinary two-person zero-sum game 
solution as a linear programming problem. The problem here turns out to 
be of a very similar form, so the reader might wish to refer to Luce and 
Raiffa before going on. 

Using the same notation as before, the solution is developed in 
nine steps as follows; 

(1) Let F(b) have dimensions m x n, and consider the matrix equation 
F(b) ' Q = B . 

1. Not necessarily unique. 
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Tlie n x 1 matrix B is defined by this equation, and consists of elements 



8^ (j “ l,2,.,,n) with 



ra 

*j-£ Vij (b> 



It is easy to see that B is the probability that Y £ b, when player I 

^ th 

uses mixed strategy Q and player II uses his j pure strategy. Note that 



- 0 for all j. 



(2) Define Bj q = max Bj 



Player I wishes to maximize P Ly > bj , and therefore to minimize P [Y £ b] . 
Hence, under our assumption about the behavior of player II which was pre- 
viously stated, the objective of player I becomes: choose Q so as to min- 
imize B. . 

(3) Define the m x 1 matrix U = (l/Bj^)Q , and define l n to be an 
n x 1 matrix, each element of which is unity. Then we have 



«/ B Jo )B * 1„ , 

and consequently, 

F(b)'U 6 . 1 . 

n 

(4) We can denote the matrix U by U' = (u^.uj, . . ,u m ) , where 
a (q^/Bj^) , i “ 1, 2, . . . ,m ( 

Note that u^ 0 for all i. The u^ are the non-negative variables of this 
linear programming problem. 
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(!>) Next, note that 



u; ra_ 

Z_ u i a 2L <V B t > 

i»l i=l 1 J ° 




This can also be written as 



l'U = 1/B 

ra Jo 



where ^ is an to x 1 matrix of one's. Since player I wishes to minimize 

B. , he therefore will maximize 1/B. , which is equivalent to maximizing 
Jo Jo 

the linear form l'U . 

m 

(6) Thus, l^U is the linear form in non-negative variables to be max- 
imized, and from step (3) above, the matrix equation F(b)*U^l^ represents 
a set of linear constraints. 

(7) There is, however, another set of linear constraints to be con- 
sidered. Recall that under Criterion B, we demand P [y^c] ^ a , against 
each pure strategy of player II. The matrix equation which expresses this 
requirement is 

F(c)'Q^a l n . 

We need to include this equation among the constraints. To do this, re- 
call that 



Therefore, 



or 




- r U . 

IQ 



P(c)'Q(l/B ) ^ a 1 (l'U) , 

jo n ro 



F(c)' U ^ a 1^ 0 , 

where 1 is an m x n matrix, each element of which is unity. Now define 
ton 

0 n to be an n x 1 matrix of zeros, and rewrite the last equation as 
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fp(c) * - a 1' 1 D^O 

nm- n 

(8) Thus far, then, we have the problem 

maximize 1' U , 
tn 

subject to 

F(b)'U^l and [f(c)' - a 1' ] D 4 0 
n ran n 

The two equations expressing the constraints can be combined into one 
matrix equation. Define two partitioned matrices A and C as follows: 



F(b)' 


and C = 




F(c)*-al^ 




On 



(9) The linear programming problem may now be written in final form 
as 

maximize 1^ U , subject to A' U ^ C 
Of course, once U has been determined, player I's optimal strategy 
Is found by computing 1/B^ * ^ u l >3® that ^optimal = Bjo V . 

In conclusion, note that under Criterion B all the information neces- 
sary to construct the matrix A is assumed given, so that only the comput- 
ational problem remains. Furthermore, one can determine whether or not 

a solution exists by computing min max Q' F(c) P. If this quantity is 

Q P 

greater than a, it is obvious that there can be no solution; conversely, 
if it is smaller than a, there must be a solution. 

7. Extension to the case of a real payoff which is not one-dimensional . 

In the previous sections we considered games whose outcomes were measur- 
able in terms of a single real commodity, e.g., money. Let us now consider 
a case which, if not completely general, surely possesses enough general- 
ity to be of practical use. 
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Suppose that a geese can result in any one of r different outcomes, or 
’'states" as we shall call them. Denote these states by Sj , j - 1,2, ... ,r. 
Sj must be distinguishable from , for all j. 

Next, suppose that player I can make a complete and transitive order- 
ing of the states in terms of preference, where the word is used in its 
everyday sense. By this we mean 

(1) For any two states and Sj , 1 + j , player 1 either prefers 
to , or vice versa. 

(2) If player I prefers to S^» and to , then he prefers 
to ^ . 

When player X is indifferent between and Sj , i.e.» has equal pre- 
ference for them, we interpret this to mean that the relations "3j pre- 
ferred to Sj" and "S^ preferred to Sj" hold simultaneously. However, in 
order to simplify notation, we shall assume a strict ordering exists, with 
no cases of indifference. This assumption does not affect the validity of 
what follows; the case of indifference can be included at the expense of 
some extra symbols. 

Now, let the states be re-numbered in ascending order of preference, 
that is, Sj is the least preferred state, the next- to- least preferred, 
and so on until S r is the most preferred. During this process, we retain 
the identity of each state, that is, we still know what real outcome each 
of the symbols represents. 

Next we define S to be the ordered set S s jSj , §2 , . , . , S £ j of 
the re-numbered states. It is important to note that the elements of S 
are disjoint . 

The purpose of all this is to lead to the following function. We 
define X as a function with domains and range the set of integers 
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X 





such that 



Sj * J for all SjCS. 

The function X is of course a random variable defined on the "sample 
space" S. Inasmuch as the probability distribution over S is known* for 
each choice (i,j) of pure strategies, the distribution of X is also known. 
Let us illustrate this fact. Suppose that corresponding to the choice 
(i,j) of pure strategies, the distribution of the outcome was 

outcome is S^] = p^ for all 




** 0 elsewhere. 

The corresponding distribution of X is given by 

P i j [x-k] * p k for k - 1,2, ... ,r 

■ 0 elsewhere. 

Hence, given the matrix FCb^), k « 1,2, ...,r, of distribution functions 
(which corresponds to the matrix F(x) of previous sections), we can immed- 

ft 

lately construct the matrix F (k), k 8 l,2,...,r, the elements of which 
are the distribution functions of X, corresponding to the pairs (i,j) of 
pure strategies. 

Since we are dealing with a discrete random variable, we can also con- 

ft 

struct the matrix of probability mass functions which corresponds to F (k). 
Denote this matrix of mass functions by J(k). 

Now we can repeat essentially all the computations of previous sections. 
For instance, if Q and P are the mixed strategies of the players, then (in 
terms of player I's preferences) 



1. By hypothesis; see p. 2. 
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P outcome is preferred to Sjj ® P [outcome is Sj + j or 
Sj +2 or. . .or S r ] - P [x - j+l] + p[x - j+2j + ... + p[x - rj 
- Q' J(J+1) P + Q' J(j+2) P+ ...+ Q’ J(r) P 
s Q' [j(J+U + J(j+2) + ... + J(r>] P . 



Another example: 



P L outcome is either S. or S or. ...or S ] = 

i j k 



p[x = ij + p[x = j] +„...+ p[x - k] - 

Q' [ J(i) + J(j) + ....+ J(k)] P . 

We shall not pursue the details, but it should be clear, and the reader 
can easily verify, that everything goes through as before, including the 
solutions under Criteria A and B. 

Note that we have said nothing about the nature of the states S ^ . 

They may be defined in any way at all, quantitatively or qualitatively. 

We have required only that they be distinguishable, and that player I oust 
have a transitive preference ordering over them. 

8. Some comments concerning the distribution function G(y) . 



In section 2 we derived the distribution function 

0(y) - 2 *,P, r 4.(y> * <?' PCyJP . 

i n 1 J iJ 

It seems worthwhile to point out that G(y) represents a special case 
of an interesting class of distribution functions called "mixture distrib- 
utions". For an excellent discussion of this type of function, and some 
of its history, see l7_! . Generally speaking, a distribution of this type 
arises when a sample is to be drawn from one of several populations, but 
only a probability statement can be made as to exactly which one. 

To illustrate one of the interesting properties of mixture distributions. 
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let u 3 find the isean of G(y) 



OO , OQ 

E(Y) ■ ' y<IC(y) - ^p, iydP (y) “ £ , 

i,J 3 L lj 1.3 J J 

where Ojj > E(X tJ ). If M « ||(« iJ )|| , then 



E(Y) « Q'MP. 



It Is easy to show that, if M 11 = | (e(x"j))S| , then 



E(Y”) « q'm“p 



Consequently, the variance of G(y) is given by 
Var(Y) = E(Y 2 ) - E 2 (Y) * Q'M 2 ? - m" . 

One may wish to ask, "If the distribution of Y can be written in terms 
of the distributions as shown, what is the functional relationship be- 
tween Y and the Xjj?". This can lead to some confusion unless it is real- 
ized that Y is defined by the equation G(y) = P [y ^ y] = Q'F(y)P . No 
other relation between Y and the X^j is needed. 

9. Conclusion . 

There are two major conclusions which, it is hoped, the reader will 
have drawn from this paper. They are 

(1) A major practical stumbling block in the application of Game 
Theory to real problems is the unavailability of linear utility functions. 

We have seen how necessary a knowledge of these functions is, when it comes 
to making actual computations in order to solve a matrix game in the "usual" 
sense. 

(2) Despite this difficulty, there are reasonable procedures which can 
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be adopted, provided drcuaitflnces warrant, In order to define and coeputa 

’’optimal" solutions,, 

This paper i® Intended only to illustrate «©ra@ ideas, and not to make 
a complete analysis of any particular class of solutions* We hope the 

reader will agree that the general scheme discussed here has possibilities 
for use in many situations other than the ones we have looked at. 
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APPENDIX A 



AXIOMATIC DEVELOPMENT OF THE LINEAR UTILITY FUNCTION 



Chapter 2 of "Games and Decisions" by Luce and Ralffa gives an ex- 
cellent presentation of this subject, and the reader is referred to 
their book for a detailed discussion of the axioms which follow. 

Definitions and notation. Assume all gambles or "lotteries" under 
discussion involve a finite set of prizes or alternatives, 

A => \ Ap A^, A f ) . A simple lottery is defined to be a chance 

mechanism which associates with each alternative A^ a known probability 
p^ = P l (Aj), i = l,2,...,r , ^ p^ * 1, A simple lottery L is thus a 
probability distribution over A, and may also be regarded as an r- tuple, 
L s (PjAp P2^2» •••» ) » 

A compound lottery is a two-stage chance mechanism, which, with 
probability q^, yields a simple lottery as the result of its first 

stage, (i ® l,2,..,s; = 1). The simple lottery then con- 

stitutes the second stage. We symbolize a compound lottery by 



The purpose is to derive the utility function of an individual . 

When we speak of preferences, they are the preferences of that individual . 



them, and L £ L' or X i I. expresses the same relationship between two 



The symbol ~ denotes indifference between the two alternatives. 

Throughout what follows, it is assumed that the set A has been 
ordered so that Aj is the most preferred alternative, and so on down. 



Note that £ is again no more 
than a probability distribution over A, where P^ (A^) = 





The symbol £ expresses a preference ordering. That is, Aj t Aj 
means that the Individual prefers to or is indifferent between 
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until Ap is the least preferred. This can be done with no loss of gener- 
ality. 



Axiom 1 . The preference ordering Ji , a binary relation, yields a complete 
and transitive ordering of A. 

"Complete" means, for any A^ and Aj in A, either A^i A^ or A^i A^. 

Axiom 2 . Let X be a compound lottery and L a simple lottery. If 
P l (Ai> = (A^) for all Aj in A, then X~ L. 

This soys that it is only the character of a lottery as a probability 
distribution which interests the individual. 

Axiom 3 . Each alternative A^ is indifferent to some simple lottery 
which mixes only A^, the best alternative, and A^, the worst alternative. 

That is, for every A^ in A, there exists some probability uj, such that 

A i'"t u i A l» * 

Axiom 4 . If Aj^jujA^, (1 - * tt * en in a °y si«pl® lottery 

L ■ (pjA^, ..., p^A^, ...» PrAj.), pjA 4 may be replaced by PiCuiAj, (1 - ux)A r ] . 

Axiom 5 . The relations £ and are transitive among lotteries, just as 
they are among the elements of A. 

This is obviously desirable, but it cannot be deduced from the pre- 
ceding axioms, and so must be included separately. 

Axiom 6 . The relation [pA^, (1 - p)A r ] £ jp’Aj, (1 - p'jAyl holds if 
and only if p^ p'. 

These six axioms give us the following theorem, which is the point 
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/ 









of the entire development; 



Theorem. If the relation -h satisfies axioms 1 through 6, then there 
exist numbers u ; associated with the elements of A, such that for 
any two lotteries L and L 1 , the relation L Jt L' holds if and only if 

ti u i p L (A f> i- “iV <V • 

i=l i=»l 

The proof is obvious from the axioms. Any compound lottery can be 

reduced to a simple lottery involving A^ k t , which can in turn be 

converted to another compound lottery involving Just and A r> which can 
be reduced to a simple lottery involving A^ and A r , which can by axiom 6 
be compared with any lottery so treated. The "experimental" part of the 
program lies in finding the numbers u^ of Axiom 3. These first appear 
in the guise of pro babili ties , and later turn out to be the utilities 
which we sought. It is easy to show that any positive linear transfor- 
mation of these numbers again yields a utility function which reflects 
the individual's preferences. 

The reader who has followed through this derivation should easily 
be able to verify the assertions made in section 3 of this paper. 
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APPENDIX B 



A NON-RIGOROUS EXTENSION OF THE LINEAR UTILITY CONCEPT. 

In Appendix A, Che assumption was made that the set A of alternatives 

associated with any lottery L was finite, and that the lottery comprised 

a probability mass function P T (A, ) defined on A. Suppose now we wish to 

i 

enter a gamble G involving some numerically measurable and infinitely 
divisible commodity x. Associated with G there is a probability density 
function fg(x), defined over x. 

We want to make all the logic and conclusions of Appendix A valid 
for this case also, so we take the following position. All "continuous" 
gambles discussed in this paper are such that 

(1) If I is the interval over which x is defined, I may be parti- 
tioned into a finite number of disjoint subintervals dx^, whose union 

is I. If X is the random variable of the gamble, P [X 6 dxjj = f {J (x i )dx i , 
where x^Clx^ 

(2) By proper choice of dx^ and x^, we can approximate £q(x) as 
closely as we like by a probability mass function fQ(x^)dx£, 

Once this has been done, all the logic of Appendix A will go through, 
and the numbers u can be found. We now assume that the set of numbers 
u x thus produced can be represented, as dx^O and the number of subinter- 
vals increases without limit, by a continuous curve u(x). When this is 
the case, it is intuitively clear that the continuous expected value 
calculation will have the same characteristics as the discrete calculation 
in Appendix A. 
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